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BACKGROUND OF THE INVENTION 
Humans have exploited the use of genetics and microorganisms for 
their own advantage throughout much of recorded history. Egyptians are credited 

10 with the first use of yeast to produce leavened bread sometime between 4000-2000 
BC, Gregor Mendel produced peas having specific, defined, characteristics in the 
mid-19th century, and the Food and Drug Administration approved the first 
recombinant drug, human insulin, in 1982. This last feat is often considered to be the 
beginning of the modern biotechnology industry. Since then, transgenic plants, 

15 recombinant foods, recombinant vaccines, cancer therapeutics, recombinant 

antibodies, enzymes, glycosyltransferases, cytokines, coagulation factors, hormones, 
dermal replacements, anti-virals, and many other recombinant proteins have been 
developed for human use. 

The nucleic acid expression vector has greatly aided in the production 

20 of recombinant proteins and therapeutics. A nucleic acid encoding a protein reagent 
or therapeutic protein can be cloned into an expression vector, which can be expressed 
in a population of eukaryotic and/or prokaryotic cells, thus producing a large amount 
of a recombinant protein or therapeutic. However, the yield and quality of the 
recombinant product depend greatly on the expression vector and microorganism used 

25 to express the vector. In addition, the use of recombinant cells can be slow and tax 
resources that can be otherwise used for discovery and improvement of recombinant 
proteins and therapeutics. 

As the demand and usefulness of recombinant proteins increases, new 
methods are required in order to more efficiently prepare such proteins with a rapid 

30 turnaround time. Moreover, as recombinant proteins for the treatment of a variety of 
diseases are generated, methods to lower the cost of their production need to be 
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implemented so that these technologies are available to all those in need. The need to 
provide improved vectors for protein expression does not exist solely in the 
therapeutic protein arena. Rather, this need also extends to the production of proteins 
or reagents(e.g., enzymes) for use in the production of both protein and non-protein 
5 therapeutics. 

Over the past several decades, recombinant proteins and therapeutics 
have proven to be the answer in treating many diseases that were not addressed using 
conventional, chemical therapeutics. However, recombinant technology has been 
hampered by inefficiency, especially in small scale situations, as well as high cost and 
10 slow turnaround time. Streamlining the expression of proteins at a lower cost with a 
quicker turnaround time for virtually any customer situation is needed to realize the 
potential of recombinant proteins as reagents and as therapeutics. The present 
invention meets this need. 

15 BRIEF SUMMARY OF THE INVENTION 

The invention includes a method of providing a therapeutic protein to a 
customer, comprising cloning a nucleic acid encoding a protein into a pCWinl 
expression vector as set forth in SEQ ID NO:l, expressing a protein therefrom, and 

20 providing the protein to a customer. 

In another aspect of the invention, a method of providing a therapeutic 
protein to a customer comprises cloning a nucleic acid encoding a protein into a 
pCWin2 expression vector as set forth in SEQ ID NO:2, expressing a protein 
therefrom, and providing the protein to a customer. 

25 In yet another aspect of the invention, a method of providing a 

therapeutic protein to a customer comprises cloning a nucleic acid encoding a protein 
into a pCWin2/MBP expression vector as set forth in SEQ ID NO:3, expressing a 
protein therefrom, and providing the protein to a customer. In still another aspect, a 
method of providing a therapeutic protein to a customer comprises cloning a nucleic 

30 acid encoding a protein into a pCWin2-MBP~SBD (pMS 3 9) expression vector as set 
forth in SEQ ID NO: 10, expressing a protein therefrom, and providing the protein to a 
customer. In yet another aspect of the invention, a method of providing a therapeutic 
protein to a customer comprises cloning a nucleic acid encoding a protein into a 
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pCWin2-MBP-MCS-SBD (pMXS 3 9) expression vector as set forth in SEQ ID NO: 11, 
expressing a protein therefrom, and providing the protein to a customer. 

In an embodiment of the invention, a pCWIN2/MBP vector comprises 
a protease cleavage site coding sequence between the MBP coding sequence and the 
5 therapeutic protein coding sequence. 

Therapeutic proteins useful in the present invention include 
erythropoietin, human growth hormone, granulocyte colony stimulating factor, 
interferons alpha, -beta, and -gamma, Factor IX, follicle stimulating hormone, 
interleukin-2, erythropoietin, anti-TNF-alpha, and lysosomal hydrolases such as beta- 
10 glucosidase, alpha-galactosidase-A, beta-hexosaminidase, beta-galactosidase, alpha- 
galactosidase, alpha-mannosidase, beta-mannosidase, alpha-L-fucosidase, beta- 
glucuronidase, alpha-glucosidase, alpha-N-acetylgalactosaminidase, and acid 
phosphatase. 

In one embodiment of the invention, a method of providing a protein to 
15 a customer includes cloning a nucleic acid encoding a protein into a pCWinl 

expression vector as set forth in SEQ ID NO:l, expressing a protein therefrom, and 

providing the protein to a customer. 

In another embodiment of the invention, a method of providing a 

protein to a customer includes cloning a nucleic acid encoding a protein into a 
20 pCWin2 expression vector as set forth in SEQ ID NO:2, expressing a protein 

therefrom, and providing the protein to a customer. 

Li another embodiment of the invention, a method of providing a 

protein to a customer includes cloning a nucleic acid encoding a protein into a 

pCWin2/MBP expression vector as set forth in SEQ ID NO:3, expressing a protein 
25 therefrom, and providing the protein to a customer. In still another aspect, a method of 

providing a protein to a customer comprises cloning a nucleic acid encoding a protein 

into a pCWin2-MBP-SBD (pMS 3 9> expression vector as set forth in SEQ ID NO: 10, 

expressing a protein therefrom, and providing the protein to a customer. In yet 

another aspect of the invention, a method of providing a protein to a customer 
30 comprises cloning a nucleic acid encoding a protein into a pCWin2-MBP-MCS-SBD 

(pMXS 3 9> expression vector as set forth in SEQ ID NO: 11, expressing a protein 

therefrom, and providing the protein to a customer. 

In one aspect of the invention, a protein may be a glycosyltransferase 

or a sugar nucleotide-generating enzyme. 
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In an aspect of the invention, an expression vector includes an affinity 
tag coding sequence. In this aspect of the invention, an affinity tag may be a histidine 
tag, a Factor IX tag, a glutathione-S -transferase tag, starch-binding domain and a 
FLAG-tag. 

5 The invention includes a method of providing a therapeutic protein to a 

customer, where the method includes providing an expression vector to a protein 
production facility wherein a nucleic acid encoding a protein is cloned into the 
expression vector and the protein is expressed therefrom in the protein production 
facility; subsequently providing the protein to a customer. In an aspect of the 
10 invention, the expression vector comprises a multiple-cloning region and an antibiotic 
resistance marker. The antibiotic resistance marker may be kanamycin, tetracycline, 
or chloramphenicol. In another aspect of the invention, the expression vector includes 
an affinity tag. 

In an embodiment of the invention, a method of providing a protein to 

15 a customer includes providing a pCWinl vector as set forth in SEQ ID NO: 1 to a 
protein production facility, wherein a nucleic acid encoding a protein is cloned into 
the expression vector and the protein is expressed therefrom in the protein production 
facility, and the protein is provided to a customer. 

In an embodiment of the invention, a method of providing a protein to 

20 a customer includes providing a pCWin2 vector as set forth in SEQ ID NO:2 to a 
protein production facility, wherein a nucleic acid encoding a protein is cloned into 
the expression vector and the protein is expressed therefrom in the protein production 
facility, and the protein is provided to a customer. 

In an embodiment of the invention, a method of providing a protein to 

25 a customer includes providing a pCWin2/MBP vector as set forth in SEQ ID NO:3 to 
a protein production facility, wherein a nucleic acid encoding a protein is cloned into 
the expression vector and the protein is expressed therefrom in the protein production 
facility, and the protein is provided to a customer. In another embodiment of the 
invention, a method of providing a protein to a customer includes providing a 

30 pCWin2-MBP-SBD (pMS 39 ) vector as set forth in SEQ ID NO: 10 to a protein 
production facility, wherein a nucleic acid encoding a protein is cloned into the 
expression vector and the protein is expressed therefrom in the protein production 
facility, and the protein is provided to a customer. In still another embodiment of the 
invention, a method of providing a protein to a customer includes providing a 
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pCWin2-MBP-MCS-SBD (pMXS 3 9> vector as set forth in SEQ ID NO: 11 to a protein 
production facility, wherein a nucleic acid encoding a protein is cloned into the 
expression vector and the protein is expressed therefrom in the protein production 
facility, and the protein is provided to a customer. 
5 In one aspect of the invention, a protein production facility is in-house. 

In another aspect of the invention, the protein production facility is offsite. 

The invention also includes a method of providing a protein to a 
customer, wherein at least one glycosyl moiety is added to a protein prior to providing 
the protein to a customer. In one aspect, a glycosyl moiety is added to a protein in 
10 vitro. 

The present invention includes a method of providing a protein to a 
customer, comprising cloning a nucleic acid encoding said protein into a pCWinl 
expression vector as set forth in SEQ ID NO:l, inserting the vector into a bacterial 
host cell, expressing the protein in the host cell, and providing the protein to a 
15 customer. 

Another embodiment of the invention includes a method of providing a 
protein to a customer, comprising cloning a nucleic acid encoding said protein into a 
pCWin2 expression vector as set forth in SEQ ID NO:2, inserting the vector into a 
bacterial host cell, expressing the protein in the host cell, and providing the protein to 
20 a customer. 

Another embodiment of the invention includes a method of providing a 
protein to a customer, comprising cloning a nucleic acid encoding said protein into a 
pCWin2/MBP expression vector as set forth in SEQ ID NO:3, inserting the vector 
into a bacterial host cell, expressing the protein in the host cell, and providing the 

25 protein to a customer. Yet another embodiment of the invention includes a method of 
providing a protein to a customer, comprising cloning a nucleic acid encoding said 
protein into a pCWin2-MBP-SBD (pMS 39 ) expression vector as set forth in SEQ ID 
NO: 10, inserting the vector into a bacterial host cell, expressing the pr6tein in the host 
cell, and providing the protein to a customer. Still another embodiment of the 

30 invention includes a method of providing a protein to a customer, comprising cloning 
a nucleic acid encoding said protein into a pCWin2-MBP-MCS-SBD (pMXS 3 9) 
expression vector as set forth in SEQ ID NO: 11, inserting the vector into a bacterial 
host cell, expressing the protein in the host cell, and providing the protein to a 
customer. 
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In one embodiment of the invention, a method includes adding at least 
one glycosyl moiety to a protein prior to providing the protein to a customer. In one 
aspect, a glycosyl moiety is added to a protein in vitro. 

The invention features isolated pcWINl expression vector comprising 
5 the sequence set forth in SEQ ID NO:l. The invention also features an isolated 
pcWINl expression vector consisting of the sequence set forth in SEQ ID NO:l. 

In another aspect, the invention features an expression vector 
comprising the sequence set forth in SEQ ID NO:2. The invention also features an . 
isolated pcWIN2 expression vector consisting of the sequence set forth in SEQ ID 
10 NO:2. 

In yet another aspect, the invention features an isolated pcWIN2/MBP 
expression vector comprising the sequence set forth in SEQ ID NO:3. The invention 
also features an isolated pcWIN2/MBP expression vector consisting of the sequence 
set forth in SEQ ID NO:3. The invention further features a pcWIN2/MBP expression 

15 vector, wherein the pCWIN2/MBP vector comprises a protease cleavage site coding 
sequence adjacent to the MBP coding sequence. 

In another aspect, the invention features an isolated pCWin2-MBP- 
SBD (PMS39) expression vector comprising the sequence set forth in SEQ ID NO: 10. 
The invention also features an isolated pCWin2-MBP-SBD (pMS 3 9> expression vector 

20 consisting of the sequence set forth in SEQ ID NO: 10. 

In still another aspect, the invention features an isolated pCWin2- 
MBP-MCS-SBD (pMXS 39 ) expression vector comprising the sequence set forth in 
SEQ ID NO:l 1. The invention also features an isolated pCWin2-MBP-MCS-SBD 
(PMXS39) expression vector consisting of the sequence set forth in SEQ ID NO: 11. 

25 The invention features a method of expressing a protein from an 

isolated pcWINl expression vector comprising the sequence set forth in SEQ ID 
NO:l. In another embodiment, the invention features a method of expressing a 
protein from an isolated pcWIN2 expression vector comprising the sequence set forth 
in SEQ ID NO:2. In yet another embodiment, the invention features a method of 

30 expressing a protein from an isolated pcWIN2/MBP expression vector comprising the 
sequence set forth in SEQ ID NO:3. In still another embodiment, the invention 
features a method of expressing a protein from an isolated pCWin2-MBP-SBD 
(PMS39) expression vector comprising the sequence set forth in SEQ ID NO: 10. In 
another embodiment, the invention features a method of expressing a protein from an 
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isolated pCWin2-MBP-MCS-SBD (pMXS 39 ) expression vector comprising the 
sequence set forth in SEQ ID NO: 11. In one aspect, the protein is expressed in a 
prokaryotic cell. 



5 BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing summary, as well as the following detailed description 
of the invention, will be better understood when read in conjunction with the 
appended drawings. For the purpose of illustrating the invention, there are shown in 
the drawings embodiment(s) which are presently preferred. It should be understood, 
10 however, that invention is not limited to the precise arrangements and 
instrumentalities shown. In the drawings: 

Figure 1 A is an image of an electrophoretic gel containing products of 
a restriction digest. Lanes 1 and 3 are BstEII DNA Marker, lane 2 is Sacl/Xbal- 
digested Cst-04 vector and lane 4 is Kan r PCR product digested with Sacl/Xbal. 
15 Figure IB is an image of an agar plate showing the result of Cst-04- 

Kan r transformation plated on LB kan r plate. 

Figure 1C is an image of an electrophoretic gel containing DNA from 
an E. coli colony that screened positive for the Cst-04-Kan r insert. Lane 1 contains 
BstEII DNA Marker. Lanes 2-4 contain DNA isolated from the Cst-04-Kan5 colony. 
20 Lane 2 contains DNA cut with Ndel, lane 3 contains DNA cut with Sail, arid lane 4 
contains DNA cut with Pstl . 

Figure ID is an image of an ampicilhn-containing agar plate and a 
kanamycin-containing agar plate, on both of which Cst-04-Kan5 was streaked. The 
ampicillin-containing plate inhibited the growth of Cst-04-Kan5-containing cells, 
25 demonstrating that the ampicillin gene in Cst-04-Kan5-containing cells is inactive, 
whereas the kanamycin-containing plate supports the growth of Cst-04-Kan5- 
containing cells, demonstrating that the kanamycin gene in Cst-04-Kan5-containing 
cells is operative. 

Figure IE is an image of thin-layer chromatography of the products of 
30 the activity of Cst-04Kan5 plasmid-containing cell lysates using lacto-N-neotetraose 
as a substrate. Lanes labeled 1 and 2 are Cst-04Kan5 from JM109 cells, lanes labeled 
3 are Cst-04Kan5 isolated from TGI cells, and lanes labeled 4 are Cst-04-6-1. 

Figure 2A is an image of the agarose gel from which restriction 
enzyme-digested PCR products were isolated. Lanes marked "M" contain 1 kb DNA 
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markers, lane 1 contained pCWINl insert, lane 2 contained Ndel/Scal-digested 
pCWori Kan r Cst04Kan5 vector, lane 3 contained pre-pCWIN2 insert, and lane 4 
contained BamHI/EcoRI-digested pCWori Kan r Cst04Kan5 vector. 

Figure 2B is an image of an electrophoretic gel, illustrating the results 
5 of restriction digestion of plasmid DNA isolated from positive transformants as a 
result of pCWinl and pre-pCWin2 DNA mini-prep. Lanes labeled "M" contain 1 kb 
DNA markers. Lanes 1 to 5 contain pCWinl clones. Lanes 6 to 14 contains pre- 
pCWin2 clones. All 14 clones were digested with EcoRl. 

Figure 2C is an image of an electrophoretic gel, illustrating the results 
10 of restriction digestion of plasmid DNA isolated from positive transformants as a 

result of pCWinl and pre-pCWin2 DNA mini-prep. Lanes labeled "M" contain 1 kb 
DNA markers. Lane 1 contains pCWinl clone #5, lane 2 contains pre-pCWin2 clone 
#11. Both clones # 5 and #1 1 were digested with Ndel and Seal. 

Figure 2D is an image of an electrophoretic gel, illustrating the results 
15 of restriction pCWin2 mini-prep screening. Lanes labeled "M" contains 1 kb DNA 
markers. Lanes 1 through 18 contain pCWin2 clones. The clones were all digested 
with Pstl. 

Figure 3A is an image of an electrophoretic gel containing the 

NdelZBamHLdigested malE cDNA. 
20 Figure 3B is an image of an electrophoretic gel containing the 

restriction enzyme-digested pCWin2 vector. 

Figure 3C is an image of two electrophoretic gels containing the 

restrction enzyme-digested pCWin2 vector. This figure represents the screening of 

colonies to verify that the malE Ndel and BamHI insert size was correct. The first 
25 lane on each gel contains 1 kb DNA molecular weight markers, as indicated in the 

figure. Lanes 1, 2, 3, 4, 5, 7, 8, 9 and 10 correspond to colonies selected from the 

transformation plate and which positively show the presence of the malE cDNA. 

Lane 6 corresponds to a colony selected from the transformation plate bearing a 

vector that does not contain the malE insert. 
30 Figures 4A, 4B, and 4C comprise the entire nucleotide sequence of 

pcWINl, as set forth in SEQ ID NO:l. 

Figures 5A, 5B, and 5C comprise the entire nucleotide sequence of 

pcWIN2, as set forth in SEQ ID NO:2. 
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Figiires 6 A, 6B, 6C, and 6D comprise the entire nucleotide sequence of 
pcWIN2/MBP, as set forth in SEQ ID NO:3. 

Figure 7A is an image of an electrophoretic gel illustrating the results 
of a restriction enzyme-digested PCR reaction used to create the SBD 39 insert. Lane 
5 M is a lkb DNA marker. Lane 1 is the SBD39 PCR insert product digested with Bgl H 
and BamHl. The expected size for the SBD39 insert is 447bp. 

Figure 7B is an image of an electrophoretic gel illustrating the result of 
the restriction enzyme digestion of pCWin2-MBP-ST3Gal m (A73) vector. Lane M is 
a lkb DNA marker. Lane 1 is pCWin2-MBP-ST3Gal m (A73) digested with BamHl. 
10 The expected size for the vector is 7 kb. 

Figure 7C is an image of an electrophoretic gel illustrating the results 
of the DNA mini-prep enzymatic digestion screen of pCWin2-MBP-SBD-ST3Gal m 
(A73). Lanes M are a lkb DNA marker, lanes 1 through 11 are pCWin2MBP-SBD- 
ST3Gal HI A73 construct colonies 1 through 11 respectively, digested with Ndel and 
15 BamHl. The expected size for the pCWin2-ST3Gal III A73 vector band is 5.9 kb and 
the expected size for the MBP-SBD insert is 1.6 kb. Clone #6 (Lane 6) illustrates a 
positive result. 

Figure 7D is an image of an electrophoretic gel illustrating the 
restriction enzyme digestion of pCWin2 vector with BamHl and Seal. Lane M is a 

20 lkb DNA marker. Lane 1 is digested pCWin2. The expected size for the vector is 4.3 
kb and the expected size for the BamH I /Sea I MCS insert is 0.8 kb. 

Figure 7E is an image of an electrophoretic gel illustrating the results 
of the restriction enzyme digestion of pCWin-MBP-SBD 39 -ST3Gal m (A73). Lane M 
is a lkb DNA marker. Lane 1 is pCWin-MBP-SBD 3 9-ST3Gal IH (A73) digested with 

25 BamH I and Sea I. The expected size for the vector is Linear is pCWin-MBP-SBD39 
is 5.8 kb and the expected size for the BamH I /Sea I ST3Gal III (A73) insert is 1.6 
kb. 

Figure 7F is an image of an electrophoretic gel illustrating the results 
of the DNA mini-prep restriction enzyme digestion screen of pCWin2-MBP-SBD39 
30 clones. Lanes M is a lkb DNA marker, lanes 1 through 16 are pCWin2-MBP-SBD39 
construct colonies 1 through 16 respectively, digested with Nde I and Xba I. The 
expected size for the pCWin2 vector band is 5.0 kb and the expected size for the 
MBP-SBD insert is 1.65 kb. Clone #1 (Lane 1 ) illustrates a positive result. 
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Figure 7G is a nucleic acid vector feature map, illustrating restriction 
sites for the pCWin-MBP-SBD 39 (pMS 39 ) construct. 

Figure 8 A is an electrophoretic gel illustrating the results of the PCR 
reaction to prepare the SBD insert. Lane Mis A, BstE II DNA marker, lane 1 is SBD 
5 PCR insert product. The expected size for the SBD insert is 447bp. 

Figure 8B is an image of an electrophoretic gel illustrating the results 
of DNA isloated from PCR-Blunt-SBD colonies and subjected to restriction enzyme 
digestion. Lane M is a X BstE II DNA marker, lanes 1 through 8 are PCR-Blunt-SBD 
colonies 13, 14, 15, 17, 18, 19, 20, and 22 respectively, all digested with Xhol and 
10 Sail. The expected size for the PCR-Blunt vector band is 3kb and the expected size 
for the SBD insert is 447bp. 

Figure 8C is an image of an electrophoretic gel illustrating the results 
of the restriction enzyme-digested pCWin2-MBP kan r vector. Lane M is a Ikb DNA 
ladder, lane 1 is pCWin2-MBP kan r Vector digested with Xhol and Sal L The 
15 expected size for the pCWin2-MBP kan r vector band is 3kb. 

Figure 8D is an image of an electrophoretic gel illustrating the results 
of DNA isolated from pCWin2-MBP~MCS-SBD (pMXS 39 ) vector-containing 
colonies. Lane Mis a A. BstE II DNA marker, lanes 1 through 13 are pMXS39 
colonies 1 through 13 respectively, digested with Xhol and Sail. The expected size 
20 for the pCWin2-MBP vector band is 6. Ikb and the expected size for the SBD insert is 
447bp. Two out of thirteen colonies had the correct size of insert and pMXS 39 vector 
(see lanes 4 and 5 in Figure 8D). 

Figure 8E is a nucleic acid vector feature map, illustrating restriction 
sites for the pCWin-MBP-MCS-SBD 39 (pMXS 39 ) construct. 

25 

DETAILED DESCRIPTION OF THE INVENTION 
The use of therapeutic proteins to treat patients experiencing disease or 
illness increases yearly. Protein therapeutics typically lack the same problematic side 
effects found with certain traditional chemical therapeutics. Even in instances where 
30 the protein therapeutic is altered slightly from its natural state, such a protein typically 
does not have the same side effects as do certain chemical therapeutics. Similar to the 
increase in the use of therapeutic proteins, the use of non-therapeutic, or "reagent" 
proteins increases exponentially from year to year. For example, reagent proteins are 
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used in such areas as food biochemistry, bioremediation, production of small 
molecule therapeutics, and even in the production of therapeutic proteins. 

The increasing use of protein reagents and therapeutics has enhanced 
the need for production and preparation of such proteins. It is generally impractical, 
5 in terms of cost and time, to isolate and purify a protein therapeutic from its natural 
source. The cost of isolating proteins from natural sources is prohibitive, and the 
amount of time needed for such isolation techniques is lengthy. For example, a 
difficult and time-consuming process for isolating a therapeutic protein from a natural 
source will drive up the cost of that reagent or therapeutic, which in the latter 

10 instance, may unduly burden a medical patient, the patient's insurer, or both. Further, 
a burdensome isolation process can limit the amount of therapeutic protein available 
to those in need thereof. Finally, a difficult isolation process can also overburden the 
entity that produces the reagent or therapeutic protein, reducing profits and wasting 
valuable business time. 

15 In vitro systems have therefore been developed to produce 

recombinant forms of reagent proteins and therapeutic proteins. One of the most 
significant groups of organisms used as an in vitro system for production of 
recombinant therapeutic proteins is bacteria, and in particular, Escherichia coli. E. 
coli is often used for its simplicity, as it is easy to culture and to maintain, and more 

20 importantly, it is easy to manipulate genetically. Further, it is relatively simple to 
isolate protein expressed from E. coli. 

There are numerous expression vectors that are compatible with 
bacteria, and in particular, with E. coli, for the purpose of producing recombinant 
therapeutic proteins. However, many vectors are useful only under particular 

25 circumstances, and therefore have drawbacks with respect to their utility for selected 
protein expression under the specific circumstances that may be required. The present 
invention sets forth methods of providing a protein to a customer that overcome some 
of the difficulties associated with commercial protein production. 

The present invention therefore features a method of providing a 

30 protein to a customer, wherein the protein of interest is expressed in a vector 
containing a nucleic acid encoding the protein of interest, as well as a multiple 
cloning site and an antibiotic resistance marker, and further wherein the resulting 
protein is provided to a customer. Part of the advantage of the present invention is 
that the expression vectors of the present invention reduce the complexity of 
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subcloning the cDNA encoding a therapeutic protein. Further, the expression vector 
of the present invention enables the production of proteins using an antibiotic 
resistance marker other than-the ampicillin antibiotic resistance marker, which is not 
approved for Good Manufacturing Practice (GMP) protocols required by the Food 
5 and Drug Administration. 

Part of the advantage of the present invention is due to the flexibility of 
the expression vector used to express the protein. The flexibility of a vector of the 
present invention provides that a protein can be produced rapidly and efficiently. A 
cDNA encoding a protein of interest can be readily subcloned into the expression 

10 vector by way of a multiple cloning site. Therefore, a method of the present invention 
features the use of an expression vector as described above to provide a protein to a 
customer in a more efficient manner. 

Another advantage of the present invention is that expression vectors 
of the invention offer increased productivity and efficiency of protein expression. 

15 That is, the design and use of vectors of the present invention provide increased levels 
of protein expression and production, leading to increased efficiencies in protein 
expression over similar vectors known in the art. Such advantages increase the 
quantity of protein produced, and therefore also serve to lower the cost of protein 
production and increase profit through sales of protein. 

20 The flexibility and functionality of a vector of the present invention 

increases the ease, efficiency and reliability of the delivery of a protein to a customer. 
The use of a method of the present invention to streamline and enhance protein 
product delivery to a customer not only increases the production and profitability of a 
business entity using such methods, but it also has the effect of increasing the 

25 opportunity for medical patients in need thereof with a therapeutic protein. 

The present invention also features vectors for expression of reagent 
proteins, and methods of providing reagent proteins produced using such vectors to a 
customer. Vectors of the invention useful for the expression of reagent proteins may 
be the same vectors used to produce therapeutic proteins in methods of the invention. 

30 Additionally, vectors of the invention useful for the expression of reagent proteins 

may be different vectors than those used to produce therapeutic proteins in methods of 
the invention. 

Vectors of the invention designed for production of reagent proteins 
are further useful for production of proteins that are themselves useful in the 
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subsequent production of small chemical therapeutics and for remodeling of non- 
protein molecules, such as carbohydrates. Examples of such proteins include 
glycosyltransferases, glycosidases and enzymes used in the production of sugar 
nucleotides. Further, such proteins are useful to produce and remodel carbohydrate- 
5 containing glycoproteins. The production and remodeling of glycoproteins has 

significant therapeutic impact, as glycoproteins form the basis of a significant number 
of recombinant therapeutics. 

Definitions 

10 Unless defined otherwise, all technical and scientific terms used herein 

have the same meaning as commonly understood by one of ordinary skill in the art to 
which this invention belongs. Although any methods and materials similar or 
equivalent to those described herein can be used in the practice or testing of the 
present invention, the preferred methods and materials are described herein. 

15 As used herein, each of the following terms has the meaning associated 

with it in this section. 

The articles "a" and "an" are used herein to refer to one or to more than 
one (i.e. to at least one) of the grammatical object of the article. By way of example, 
"an element" means one element or more than one element. 

20 "Encoding" refers to the inherent property of specific sequences of 

nucleotides in a polynucleotide, such as a gene, a cDNA, or an mRNA, to serve as 
templates for synthesis of other polymers and macromolecules in biological processes 
having either a defined sequence of nucleotides (i.e., rRNA, tRNA and mRNA) or a 
defined sequence of amino acids and the biological properties resulting therefrom. 

25 Thus, a gene encodes a protein if transcription and translation of mRNA 

corresponding to that gene produces the protein in a cell or other biological system. 
Both the coding strand, the nucleotide sequence of which is identical to the mRNA 
sequence and is usually provided in sequence listings, and the non-coding strand, used 
as the template for transcription of a gene or cDNA, can be referred to as encoding the 

30 protein or other product of that gene or cDNA. 

A "coding region" of a gene consists of the nucleotide residues of the 
coding strand of the gene and the nucleotides of the non-coding strand of the gene 
which are homologous with or complementary to, respectively, the coding region of 
an mRNA molecule which is produced by transcription of the gene. 
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A "coding region" of an mRNA molecule also consists of the 
nucleotide residues of the mRNA molecule which are matched with an anticodon 
region of a transfer RNA molecule during translation of the mRNA molecule or 
which encode a stop codon. The coding region may thus include nucleotide residues 
5 corresponding to amino acid residues which are not present in the mature protein 
encoded by the mRNA molecule {e.g., amino acid residues in a protein export signal 
sequence). 

An "affinity tag" is a peptide or polypeptide that may be genetically or 
chemically fused to a second polypeptide for the purposes of purification, isolation, 
10 targeting, trafficking, or identification of the second polypeptide. The "genetic" 

attachment of an affinity tag to a second protein may be effected by cloning a nucleic 
acid encoding the affinity tag adjacent to a nucleic acid encoding a second protein in a 
nucleic acid vector. 

As used herein, the term "glycosyltransferase," refers to any 
15 enzyme/protein that has the ability to transfer a donor sugar to an acceptor moiety. 

A "sugar nucleotide-generating enzyme" is an enzyme that has the 
ability to produce a sugar nucleotide.. Sugar nucleotides are known in the art, and 
include, but are not limited to, such moieties as UDP-Gal, UDP-GlcNAc, and CMP- 
NAN. 

20 An "isolated nucleic acid" refers to a nucleic acid segment or fragment 

which has been separated from sequences which flank it in a naturally occurring state, 
e.g.,, a DNA fragment which has been removed from the sequences which are 
normally adjacent to the fragment, e.g., the sequences adjacent to the fragment in a 
genome in which it naturally occurs. The term also applies to nucleic acids which 

25 have been substantially purified from other components which naturally accompany 
the nucleic acid, e.g., RNA or DNA or proteins, which naturally accompany it in the 
cell. The term therefore includes, for example, a recombinant DNA which is 
incorporated into a vector, into an autonomously replicating plasmid or virus, or into 
the genomic DNA of a prokaryote or eukaryote, or which exists as a separate 

30 molecule (e.g, as a cDNA or a genomic or cDNA fragment produced by PCR or 
restriction enzyme digestion) independent of other sequences. It also includes a 
recombinant DNA which is part of a hybrid gene encoding additional polypeptide 
sequence. 
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In the context of the present invention, the following abbreviations for 
the cpmmonly occurring nucleic acid bases are used. 11 A" refers to adenosine, "C" 
refers to cytidine, "G" refers to guanosine, "T" refers to thymidine, and "U" refers to 
uridine. 

5 A "polynucleotide" means a single strand or parallel and anti-parallel 

strands of a nucleic acid. Thus, a polynucleotide may be either a single-stranded or a 
double-stranded nucleic acid. 

The term "nucleic acid" typically refers to large polynucleotides. 

The term "oligonucleotide" typically refers to short polynucleotides, 
10 generally no greater than about 50 nucleotides. It will be understood that when a 
nucleotide sequence is represented by aDNA sequence (i.e., A, T, G, C), this also 
includes an RNA sequence (i.e., A, U, G, C) in which "U" replaces "T." 

Conventional notation is used herein to describe polynucleotide 
sequences: the left-hand end of a single-stranded polynucleotide sequence is the 5' 
15 end; the left-hand direction of a double^stranded polynucleotide sequence is referred 
to as the 5'-direction. 

A first defined nucleic acid sequence is said to be "immediately 
adjacent to" a second defined nucleic acid sequence when, for example, the last 
nucleotide of the first nucleic acid sequence is chemically bonded to the first 
20 nucleotide of the second nucleic acid sequence through a phosphodiester bond. 
Conversely, a first defined nucleic acid sequence is also said to be "immediately 
adjacent to" a second defined nucleic acid sequence when, for example, the first 
nucleotide of the first nucleic acid sequence is chemically bonded to the last 
nucleotide of the second nucleic acid sequence through a phosphodiester bond. 
25 A first defined polypeptide sequence is said to be "immediately 

adjacent to" a second defined polypeptide sequence when, for example, the last amino 
acid of the first polypeptide sequence is chemically bonded to the first amino acid of 
the second polypeptide sequence through a peptide bond. Conversely, a first defined 
polypeptide sequence is said to be "immediately adjacent to" a second defined 
30 polypeptide sequence when, for example, the first amino acid of the first polypeptide 
sequence is chemically bonded to the last amino acid of the second polypeptide 
seqiaence through a peptide bond. 

The direction of 5 f to 3' addition of nucleotides to nascent RNA 
transcripts is referred to as the transcription direction. The DNA strand having the 
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same sequence as an mRNA is referred to as the "coding strand"; sequences on the 
DNA strand which are located 5' to a reference point on the DNA are referred to as 
"upstream sequences"; sequences on the DNA strand which are 3' to a reference point 
on the DNA are referred to as "downstream sequences." 
5 Unless otherwise specified, a "nucleotide sequence encoding an amino 

acid sequence" includes all nucleotide sequences that are degenerate versions of each 
other and that encode the same amino acid sequence. Nucleotide sequences that 
encode proteins and RNA may include introns. 

"Homologous" as used herein, refers to nucleotide sequence similarity 

10 between two regions of the same nucleic acid strand or between regions of two 

different nucleic acid strands. When a nucleotide residue position in both regions is 
occupied by the same nucleotide residue, then the regions are homologous at that 
position. A first region is homologous to a second region if at least one nucleotide 
residue position of each region is occupied by the same residue. Homology between 

15 two regions is expressed in terms of the proportion of nucleotide residue positions of 
the two regions that are occupied by the same nucleotide residue. By way of 
example, a region having the nucleotide sequence S'-ATTGCC-S' and a region having 
the nucleotide sequence 5-TATGGC-3' share 50% homology. Preferably, the first 
region comprises a first portion and the second region comprises a second portion, 

20 whereby, atleast about 50%, and preferably at least about 75%, at least about 90%, or 
at least about 95% of the nucleotide residue positionss of each of the portions are 
occupied by the same nucleotide residue. More preferably, all nucleotide residue 
positions of each of the portions are occupied by the same nucleotide residue. 

As used herein, "homology" is used synonymously with "identity." 

25 The determination of percent identity between two nucleotide or amino acid 

sequences can be accomplished using a mathematical algorithm. For example, a 
mathematical algorithm useful for comparing two sequences is the algorithm of 
Karlin and Altschul (1990, Proc. Natl. Acad. Sci. USA 87:2264-2268), modified as in 
Karlin and Altschul (1993, Proc. Natl. Acad. Sci. USA 90:5873-5877). This 

30 algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et aL 
(1990, J. Mol. Biol. 215:403-410), and can be accessed, for example, at the BLAST 
site of the National Center for Biotechnology Information (NCBI) world wide web 
site at the National Library of Medicine (NLM) at the National Institutes of Health 
(NIH). BLAST nucleotide searches can be performed with the NBLAST program 
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(designated "blastn" at the NCBI web site), using the following parameters: gap 
penalty = 5; gap extension penalty = 2; mismatch penalty = 3; match reward = 1; 
expectation value 10.0; and word size = 11 to obtain nucleotide sequences 
homologous to a nucleic acid described herein. BLAST protein searches can be 
5 performed with the XBLAST program (designated "blastn" at the NCBI web site) or 
the NCBI "blastp" program, using the following parameters: expectation value 10.0, 
BLOSUM62 scoring matrix to obtain amino acid sequences homologous to a protein 
molecule described herein. 

To obtain gapped alignments for comparison purposes, Gapped 

10 BLAST can be utilized as described in Altschul et al. (1997, Nucleic Acids Res. 
25:3389-3402). Alternatively, PSI-Blast or PHI-Blast can be used to perform an 
iterated search which detects distant relationships between molecules (id.) and 
relationships between molecules which share a common pattern. When utilizing 
BLAST, Gapped BLAST, PSI-Blast, and PHI-Blast programs, the default parameters 

15 of the respective programs (e.g., XBLAST and NBLAST) can be used as available on 
the website of the National Center for Biotechnology Information of the National 
Library of Medicine at the National Institutes of Health. 

The percent identity between two sequences can be determined using 
techniques similar to those described above, with or without allowing gaps. In 

20 calculating percent identity, typically exact matches are counted. 

"Polypeptide" refers to a polymer composed of amino acid residues, 
related naturally occurring structural variants, and synthetic non-naturally occurring 
analogs thereof linked via peptide bonds, related naturally occurring structural 
variants, and synthetic non-naturally occurring analogs thereof. Synthetic 

25 polypeptides can be synthesized, for example, using an automated polypeptide 
synthesizer. 

As used herein, amino acids are represented by the full name thereof, 
by the three letter code corresponding thereto, or by the one-letter code corresponding 
thereto, as indicated in the following table: 
30 Full Name Three-Letter Code One-Letter Code 

Aspartic Acid Asp D 

Glutamic Acid Glu E 

Lysine Lys K 

Arginine Arg R 
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Histidine 


His 


H 


Tyrosine 


Tyr 


Y 


Cysteine 


Cys 


C 


Asparagine 


Asn 


N 


Glutamine 


Gin 


Q 


Serine 


Ser 


s 


Threonine 


Thr 


T 


Glycine 


Gly 


G 


Alanine 


Ala . 


A 


Valine 


Val 


V 


Leucine 


Leu 


L 


Isoleucine 


De 


I 


Methionine 


Met 


M 


Proline 


Pro 


P 


Phenylalanine 


Phe 


F 


Tryptophan 


Trp 


W 



The term "protein" typically refers to large polypeptides. 

The term "peptide" typically refers to short polypeptides. 
20 Conventional notation is used herein to portray polypeptide sequences: 

the left-hand end of a polypeptide sequence is the amino-terminus; the right-hand end 
of a polypeptide sequence is the carboxyl-terminus. 

A "therapeutic protein" as the term is used herein refers to any protein 
that is useful to treat a disease state or to improve the overall health of a living 
25 organism. A therapeutic protein may effect such changes in a living organism when 
administered alone, or when used to improve the therapeutic capacity of another 
substance. 

A "reagent protein" as the term is used herein refers to any protein that 
is useful in food biochemistry, bioremediation, production of small molecule 
30 therapeutics, and even in the production of therapeutic proteins. Typically, reagent 
proteins are enzymes capable of catalyzing a reaction to produce a product useful in 
any of the aforementioned areas. 

A "vector" is a composition of matter which comprises an isolated 
nucleic acid and which can be used to deliver the isolated nucleic acid to the interior 
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of a cell. Numerous vectors are known in the art including, but not limited to, linear 
polynucleotides, polynucleotides associated with ionic or amphiphilic compounds, 
plasmids, and viruses. Thus, the term "vector" includes an autonomously replicating 
plasmid or a virus. The term should also be construed to include non-plasmid and 
5 non- viral compounds which facilitate transfer of nucleic acid into cells, such as, for 
example, polylysine compounds, liposomes, and the like. Examples of viral vectors 
include, but are not limited to, adenoviral vectors, adeno-associated virus vectors, 
retroviral vectors, and the like. 

"Expression vector" refers to a vector comprising a recombinant 
10 polynucleotide comprising expression control sequences operatively linked to a 

nucleotide sequence to be expressed. An expression vector comprises sufficient cis- 
acting elements for expression; other elements for expression can be supplied by the 
host cell or in an in vitro expression system. Expression vectors include all those 
known in the art, such as cosmids, plasmids (e.g., naked or contained in liposomes) 
15 and viruses that incorporate the recombinant polynucleotide. 

A "multiple cloning site" as the term is used herein is a region of a 
nucleic acid vector that contains more than one sequence of nucleotides that is 
recognized by at least one restriction enzyme. 

An "antibiotic resistance marker" as the term is used herein refers to a 
20 sequence of nucleotides that encodes a protein which, when expressed in a living cell, 
confers to that cell the ability to live and grow in the presence of an antibiotic. 

The term "saccharide" refers in general to any carbohydrate, a 
chemical entity with the most basic structure of (CH 2 0) n . Saccharides vary in 
complexity, and may also include nucleic acid, amino acid, or virtually any other 
25 chemical moiety existing in biological systems. 

"Monosaccharide" refers to a single unit of carbohydrate of a defined 

identity. 

"Oligosaccharide" refers to a molecule consisting of several units of 
carbohydrates of defined identity. Typically, saccharide sequences between 2-20 
30 units may be referred to as oligosaccharides. 

"Polysaccharide" refers to a molecule consisting of many units of 
carbohydrates of defined identity. However, any saccharide of two or more units may 
correctly be considered a polysaccharide. 
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A "party" as the term is used herein refers to an individual or an entity 
involved in a transaction related to a method, a vector, or a protein of the present 
invention. For example, an individual who provides a vector to a business entity is 
considered to be a "party" in the context of the present invention. Further, the 
5 business entity is also considered to be a "party" in the context of the present 
invention. 

A "recipient" as the term is used herein refers to a specific party who 
receives a vector or a protein of the present invention.. For example, if an individual 
gives a business entity a vector of the invention, the business entity is considered to be 

10 a "recipient" in the context of the present invention. By way of another example, an 
individual within an organization may provide a second individual within the same 
organization with a vector or a protein of the present invention. The second 
individual, who is in receipt of a vector or a protein of the invention, is considered to 
be a "recipient" in terms of the present invention. Itshould be noted that a recipient 

15 may be a customer, but that not all customers are recipients. 

A "customer," as the term is used herein, refers to an intended recipient 
of a specific item in a formal transaction. A customer is also a recipient, but is 
distinct from a recipient in that a customer is recipient who is an endpoint for a 
transaction, whereas a recipient may be an intermediate in a larger transaction. For 

20 example, customers of the present invention include, but are not limited to, an entity 
responsible for the creation of an expression vector that contains the cDNA encoding 
a protein of interest, if that entity will use the protein produced. A customer is also an 
entity responsible for expression of a protein from a vector that contains the cDNA 
encoding a protein of interest, if that entity will use the protein produced. A customer 

25 also may be an entity that purchases a protein expressed from a vector of the present 
invention, for the purpose of using the protein. Further, the entity that creates an 
expression vector of the present invention may be a customer if that entity uses the 
protein produced by the vector. 

A "protein production facility" as the term is used herein is any 

30 location that has the ability to express a protein encoded within a nucleic acid vector. 

As the term is used herein, "in-house" refers to dealings within a single 
organization. In this context, an organization may be a single company, two or more 
jointly cooperating laboratories, or a corporation and its subsidiaries, collectively. 
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"Offsite," as the term is used herein, refers to dealings that extend 
beyond the "in-house" context.discussed above. For example, the transfer of a vector 
of the invention from a first organization to a second organization is a transfer of that 
vector "offsite." 

5 

I. Vectors 

The present invention includes an isolated nucleic acid encoding a 
protein operably linked to a nucleic acid comprising a promoter/regulatory sequence 

10 such that the nucleic acid is capable of directing expression of the protein encoded by 
the nucleic acid. Thus, the invention encompasses expression vectors and methods 
for the expression of proteins based on exogenous DNA introduced into cells with 
concomitant expression of the exogenous DNA in the cells such as those described, 
for example, in Sambrook et al. (1989, supra), and Ausubel et aL (1997, supra). 

15 An expression vector of the present invention is based on the pcWori+ 

vector (Muchmore et al., 1987, Meth. Enzymol. 177:44-73). However, the pcWori+ 
vector by itself is not adequate for the production of protein reagents and therapeutic 
proteins according to the present invention. The pcWori+ vector contains an 
ampicillin resistance marker. Certain regulatory agencies require that the production 

20 of proteins for therapeutic use cannot be carried out using recombinant vectors 

containing ampicillin resistance genes. Therefore, an expression vector of the present 
invention features an antibiotic resistance marker approved by the U.S. Food and 
Drug Administration (EDA) for use in the production of protein reagents and 
therapeutic proteins. Such antibiotic resistance markers include, but are not limited 

25 to, kanamycin, tetracycline, and chloramphenicol. 

In the invention, the ampicillin resistance marker normally present in 
the pcWori+ vector is disabled as follows. Briefly, the ampicillin resistance marker in 
the pcWori+ vector is disrupted in order to produce, in part, a vector of the present 
invention. PCR primers designed to create Pvul and Seal restriction enzyme 

30 cleavage sites on either end of a kanamycin resistance gene are used, and the resultant 
PCR product is digested with Pvul and Seal restriction enzymes. The ampicillin 
resistance gene in a pcWori+ vector is also cut with Pvul and Seal restriction 
enzymes. Subsequently, the kanamycin resistance gene is ligated into the pcWori+ 
vector that was cleaved within the ampicillin resistance gene. 
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Verification of successful disruption of the ampicillin resistance gene 
and successful insertion of the kanamycin resistance gene is observed by transforming 
E. coli cells, for example, with the ligation mixture. Growth of the transformed cells 
on kanamycin-containing agar plates confirms the successful insertion of the 
5 kanamycin resistance marker, while lack of growth on ampicillin containing plates 
confirms the successful disruption of the ampicillin resistance gene. Other methods 
of disruption or deletion of the ampicillin resistance gene will be known to one of 
skill in the art. Similarly, other methods of inserting the kanamycin resistance gene, 
or any other antibiotic resistance gene useful in the present invention, and methods of 
10 confirming the insertion and/or deletion of genes will also be known to one of skill in 
the art. 

Another feature of a vector of the present invention is a versatile and 
highly-functional multiple cloning site. As described in, for example, in Sambrook et 
al. (1989, supra), a "multiple cloning site" is a nucleic acid having a sequence 

15 encoding more than one restriction enzyme recognition site. The practical purpose of 
a multiple cloning site is to allow the ligation (i.e., "insertion") of an exogenous 
polynucleotide into the multiple cloning site, wherein the exogenous polynucleotide 
may have different restriction enzyme recognition sequences at its 5' and 3' ends. 
That is, the multiple cloning site allows flexibility with respect to the identity of the 5' 

20 and 3' ends on an exogenous polynucleotide, thus facilitating the cloning of such a 
polynucleotide into the multiple cloning site. 

A multiple cloning site is most often found, and is most useful, in a 
nucleic acid vector such as a vector of the present invention. As will be known to the 
skilled artisan, a multiple cloning site may be located adjacent to other functional 

25 elements in a vector, such as a promoter. A multiple cloning site may also be 

designed such that insertion of an exogenous polynucleotide into the multiple cloning 
site results in the exogenous polynucleotide being expressed in frame with the 
adjacent elements to create a fusion protein of the protein encoded by the exogenous 
polynucleotide and the protein encoded by the adjacent element. 

30 Accordingly, a vector of the present invention contains at least one 

multiple cloning site. The creation of a functional multiple cloning site in a vector of 
the present invention is described in greater detail elsewhere herein. Briefly, a 
multiple cloning site may be designed and synthesized de novo, or it may be isolated 
from another pre-existing vector. PCR methods are used to create multiple cloning 
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site polynucleotides having specific restriction enzyme recognition sites on either end 
(5' and 3') of the multiple cloning site polynucleotide. A multiple cloning site 
polynucleotide is then inserted into a pcWori+ vector of the present invention by 
means of specific restriction enzyme recognition sites corresponding to those on 
5 either end of the multiple cloning site polynucleotide. As will be understood by one 
skilled in the art, various molecular biological techniques are available to insert, 
delete, and/or modify a multiple cloning site in a vector of the present invention in 
order to create a more functional and flexible multiple cloning site useful in 
connection with the present invention. 

10 Another feature of a vector of the present invention is the option of an 

affinity tag coding sequence located in the multiple cloning site. An affinity tag 
coding sequence may be inserted into the multiple cloning site adjacent to, upstream 
from, or downstream from a target protein coding sequence. As will be understood by 
one of skill in the art, an affinity tag will typically be inserted into the multiple 

15 cloning site in frame with the target protein. One of skill in the art will also 

understand that an affinity tag coding sequence can be used to produce a recombinant 
fusion protein by concomitantly expressing the affinity tag and target protein. The 
expressed fusion protein can then be isolated, purified, or identified by means of the 
affinity tag. An affinity tag is especially important when expressing proteins that are 

20 reagents and less important when expressing therapeutic proteins due to restrictions 
imposed by regulatory agencies. 

Affinity tags useful in the present invention include, but are not limited 
to, a maltose binding protein, a histidine tag, a Factor IX tag, a glutathione-S- 
transferase tag, a FLAG-tag, and a starch binding domain tag. Other tags are well 

25 known in the art, and the use of such tags in the present invention would be readily 
understood by the skilled artisan. 

Any single vector of the present invention may have more than one 
feature described herein. By way of a non-limiting example, a vector of the present 
invention may have a disrupted ampicillin resistance gene, a functional kanamycin 

30 resistance gene, and a modified, multi-functional multiple cloning site. An example 
of one such vector of the present invention is pCWinl, the sequence of which is set 
forth in SEQ ID NO: 1. A pCWinl vector of the present invention has, for example, 
two BamHI restriction enzyme recognition sites, one of which is located within the 
multiple cloning site. Another example of a vector of the present invention is 
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pCWin2, the sequence of which is set forth in SEQ ID NO:2. A pCWin2 vector of 
the present invention has, for example, only one BamHI restriction enzyme 
recognition site which is located within the multiple cloning site. 

A further example of a vector of the present invention is pCWin2- 
5 MBP, the sequence of which is set forth in SEQ ID NO:3. A pCWin2-MBP vector of 
the invention has, for example, one BamHI restriction enzyme recognition site located 
within the multiple cloning site and additionally has an E. coli maffi maltose binding 
protein coding sequence inserted into the multiple cloning site in between the Ndel 
and BamHI restriction enzyme recognition sites. The Ndel sequence in the multiple 

10 cloning site contains an ATG start codon. The pCWin2-MBP vector is therefore 

useful, for example, for expression of a fusion protein comprised of a maltose binding 
protein and a desired protein. This is achieved by inserting a polynucleotide encoding 
the desired protein into the multiple cloning site in frame with the maltose binding 
protein and expressing the entire open reading frame encoded in the multiple cloning 

15 site. 

Yet another example of a vector of the present invention is pCWin2- 
MBP-SBD39 (pMS 39 ), the sequence of which is set forth in SEQ ID NO: 10. A 
pCWin2-MBP-SBD 39 (pMS 39 ) vector of the invention has, for example, one BamHI 
restriction enzyme recognition site located within the multiple cloning site, and one 

20 EcoRI restriction enzyme recognition site located within the multiple cloning site, and 
additionally has an E. coli malE maltose binding protein coding sequence inserted 
into the multiple cloning site in between the Ndel and SacI restriction enzyme 
recognition sites. The pCWin2-MBP-SBD 39 (pMS 39 ) vector also has a starch-binding 
domain (SBD) inserted between the EcoRI and BamHI restriction sites. The Ndel 

25 sequence in the multiple cloning site contains an ATG start codon. The pCWin2- 
MBP-SBD 39 (pMS 39 ) vector is therefore useful, for example, for expression of a 
fusion protein comprised of a maltose binding protein, a starch binding domain, and a 
desired protein. This is achieved by inserting a polynucleotide encoding the desired 
protein into the multiple cloning site in frame with the maltose binding protein and 

30 starch binding domain, and expressing the entire open reading frame encoded in the 
multiple cloning site. 

Still another example of a vector of the present invention is pCWin2- 
MBP-MCS-SBD 39 (pMXS 39 ), the sequence of which is set forth in SEQ ID NO:l 1. 
As compared to, the pMXS 39 vector expresses, in one aspect of the invention, a fusion 
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protein with the structure "MBP — desired protein — SBD," whereas the pMS 39 vector 
expresses, in another aspect of the invention, a "MBP — SBD — desired protein" fusion 
protein, Accordingly, a pCWin2-MBP-SBD 39 (pMXS 39 ) vector of the invention has, 
for example, one Xhol restriction enzyme recognition site located within the multiple 
5 cloning site, and one Sail restriction enzyme recognition site located within the 
multiple cloning site, and additionally has an E. coli malE maltose binding protein 
coding sequence inserted into the multiple cloning site in between the Ndel and SacI 
restriction enzyme recognition sites. The pCWin2-MBP-SBD 39 (pMXS 39 ) vector also 
has a starch-binding domain (SBD) inserted between the Xhol and Sail restriction 

10 sites. The Ndel sequence in the multiple cloning site contains an ATG start codon. 
The pCWin2-MBP-SBD 39 (pMXS 39 ) vector is therefore useful, for example, for 
expression of a fusion protein comprised of a maltose binding protein, a starch 
binding domain, and a desired protein. This is achieved by inserting a polynucleotide 
encoding the desired protein into the multiple cloning site in frame with the maltose 

15 binding protein and starch binding domain, and expressing the entire open reading 
frame encoded in the multiple cloning site. 

A vector of the present invention, as described above, is useful for the 
production of a therapeutic protein. A polynucleotide sequence encoding a 
therapeutic protein may be inserted into the multiple cloning site using any technique 

20 known to the skilled artisan. For example, a polynucleotide sequence encoding a 

therapeutic protein may be modified to contain specific restriction enzyme recognition 
sites at the 5' and 3' ends of the polynucleotide. Such restriction enzyme recognition 
sites will correspond to recognition sites located within the multiple cloning site of a 
vector of the present invention, facilitating the insertion (by ligation) of the 

25 therapeutic protein-encoding sequence into the multiple cloning site, and when 
expressed, producing a therapeutic protein. 

Therapeutic proteins useful in the present invention are numerous and 
are well-known in the art, and are therefore not listed here. By way of a non-limiting 
example, such therapeutic proteins include erythropoietin, human growth hormone, 

30 granulocyte colony stimulating factor, interferons alpha, -beta, and -gamma, Factor 
DC, follicle stimulating hormone, interleukin-2, erythropoietin, anti-TNF-alpha, and a 
lysosomal hydrolase. Lysosomal hydrolases useful in the present invention include, 
but are not limited to, beta-glucosidase, alpha-galactosidase-A, beta-hexosaminidase, 
beta-galactosidase, alpha-galactosidase, alpha-mannosidase, beta-mannosidase, alpha- 
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L-fucosidase, beta-glucuronidase, alpha-glucosidase, alpha-N- 
acetylgalactosaminidase, and acid phosphatase. It will be understood that any mutant 
or variant of a therapeutic protein may be expressed using vectors of the present 
invention. 

5 The present invention also features a vector useful for the production 

of a non-therapeutic protein, referred to herein as reagent proteins. As will be 
understood by the skilled artisan, a reagent protein is one which does not currently 
have a therapeutic application. Such proteins include, but are not limited to, enzyme 
reagents, food enzymes, nutritional supplements, and non-active additives. Methods 

10 of expressing reagent proteins using vectors of the invention will be understood by the 
skilled artisan to be conducted in the same manner as the above-described methods of 
expressing therapeutic proteins using vectors of the present invention. 

Another feature of a vector of the present invention is the option of a 
protease cleavage site coding sequence located in the multiple cloning site. A 

15 protease cleavage site coding sequence may be inserted into the multiple cloning site 
adjacent to, upstream from, or downstream from a target protein coding sequence. As 
will be understood by one of skill in the art, a protease cleavage site will typically be 
inserted into the multiple cloning site in frame with the target protein. One of skill in 
the art will also understand that a protease cleavage site coding sequence can be used 

20 to produce a recombinant fusion protein by concomitantly expressing the protease 
cleavage site sequence and target protein. The expressed fusion protein can then be 
isolated, purified, or identified by means of the protease cleavage site. 

In an embodiment of the invention, a vector contains a coding 
sequence for a protease cleavage site which is located C-terminal to the nucleic acid 

25 sequence encoding an MBP. In one aspect, a vector is pCWin2-MBP-SBD 39 (pMS 39 ). 
In another aspect, a vector is pCWin2-MBP-MCS-SBD 39 (pMXS 39 ). 

A fusion protein containing a preselected protease cleavage site, as will 
be understood by one of skill in the art, is useful for the removal of amino acid 
sequence that is extraneous or non-essential to the expressed protein of interest. For 

30 example, a target protein may be expressed using a vector of the present invention as 
a fusion with an affinity tag for the purpose of purification of the target protein, but 
the affinity tag may not be desirable once the protein is sufficiently purified. The 
insertion of a specific protease cleavage site between the target protein and the 
affinity tag is useful for the cleavage of the affinity tag from the target protein. 
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Protease cleavage sites useful in the present invention include, but are not limited to, 
an enterokinase cleavage site, a Factor Xa cleavage site, a thrombin cleavage site, and 
a TEV protease cleavage site. The skilled artisan will understand the characteristics 
and uses of a protease cleavage useful in the present invention. 
5 The present invention also features a recombinant bacterial host cell 

comprising , inter alia, a nucleic acid vector as described elsewhere herein. In one 
aspect, the recombinant cell is transformed with a vector of the present invention. The 
transformed vector need not be integrated into the cell genome nor does it need to be 
expressed in the cell. However, the transformed vector will be capable of being 

10 expressed in the cell. In one aspect of the invention, E. coli is used for transformation 
of a vector of the present invention and expression of protein therefrom. In another 
aspect of the invention, a K-12 strain of E. coli is useful for expression of protein 
from a vector of the present invention. Strains of E. coli useful in the present 
invention include, but are not limited to, JM83, JM101, JM103, JM109, W3110, 

15 chi!776,andJA221. 

It will be understood that a host cell useful in the present invention will 
be capable of growth and culture on a small scale, medium scale, or a large scale. For 
example, a host cell of the invention is useful for testing the expression of a protein 
from a vector of the invention equally as much as it is useful for large scale 

20 production of a reagent or therapeutic protein product. Techniques useful in culturing 
host cells and expressing protein from a vector contained therein are well known in 
the art and will therefore not be listed herein. 

A host cell of the present invention may be transformed with a vector 
of the present invention to produce a transformed host cell of the invention. 

25 Transformation, as known to the skilled artisan, includes the process of inserting a 
nucleic acid vector into a host cell, such that the host cell containing the nucleic acid 
vector remains viable. Such transformation of nucleic acid into a bacterial cell is 
useful for purposes including, but not limited to, creation of a stably-transformed host 
cell, making a biological deposit, propagating the vector-containing host cell, 

30 propagating the vector-containing host cell for the production and isolation of 
additional vector, expression of target protein encoded by vector, and the like. 

Methods of transforming a vector are numerous and well-known in the 
art, and will therefore not be listed here. By way of a non-limiting example, a 
competent bacterial cell of the invention may be transformed by a vector of the 
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invention using electroporation. Methods of making bacterial cells "competent" are 
well-known in the art, and typically involve preparation of the bacterial cells so that 
the cells take up exogenous DNA. Similarly, methods of electroporation are known 
in the art, and detailed descriptions of such methods may be found, for example, in 
5 Sambrook et aL (1989, supra). The transformation of a competent cell with vector v 
DNA may be also accomplished using chemical-based methods. One example of a 
well-known chemical-based method of bacterial transformation is described by Inoue, 
et al. (1990, Gene 96:23-28). Other methods of transformation will be known to the 
skilled artisan. 

10 In one embodiment of the present invention, a Cst-04Kan5 plasmid is 

transfomed into E. coli JM109 cells using 20 |Ltl JM109 competent cells in 0.34 yd 
1.42 M beta mercaptoethanol, incubated on ice for 10 minutes, at which time JLjutl (100 
ng) Cst-04-Kan5 plasmid is added to the transformation mixture. The cell/DNA 
mixture is incubated ice for 30 minutes, then heat shocked at 42 °C for 45 seconds. 

15 The reaction is then incubated on ice for 2 minutes, at which time 80 yd SOC media is 
added. The reaction mixture is then shaken at 37°C for 1 hour, and subsequently, 
plated on LB Kan r agar plates. Identification and confirmation of the Cst-04-Kan5 
plasmid DNA is carried out using a restriction enzyme digestion of plasmid DNA 
isolated from positive transformants, using Ndel, Sail, PstI restriction enzymes. 

20 A transformed host cell of the present invention may be used to 

express a protein. In an embodiment of the invention, a transformed host cell contains 
a vector of the invention, which contains therein a nucleic acid sequence encoding an 
exogenous protein. The protein is expressed using any expression method known in 
the art (for example, IPTG). The expressed protein may be contained within the host 

25 cell, or it may be secreted from the host cell into the growth medium. 

Methods for isolating an expressed protein are well-known in the art, 
and the skilled artisan will know how to determine the best method for isolation of an 
expressed protein based on the characteristics of any given host cell expression 
system. By way of a non-limiting example, an expressed protein that is secreted from 

30 a host cell may be isolated from the growth medium. Isolation of a protein from a 

growth medium may include removal of bacterial cells and cellular debris. By way of 
another non-limiting example, an expressed protein that is contained within a host cell 
may be isolated from the host cell. Isolation of such an "intracellular" expressed 
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protein may include disruption of the host cell and removal of cellular debris from the 
resultant mixture. These methods are not intended to be exclusive representations of 
the present invention, but rather, are merely for the purposes of illustration of various 
applications of the present invention. 
5 Purification of a protein expressed in accordance with the present 

invention may be effected by any means known in the art. The skilled artisan will 
know how to determine the best method for the purification of a protein expressed in 
accordance with the present invention. A purification method will be chosen by the 
skilled artisan based on factors such as, but not limited to, the expression host, the 

10 contents of the crude extract of the protein, the size of the protein, the properties of 
the protein, the desired end product of the protein purification process, and the 
subsequent use of the end product of the protein purification process. 

In an embodiment of the invention, isolation or purification of a 
protein expressed in accordance with the present invention may not be desired. In an 

15 aspect of the present invention, an expressed protein may be stored or transported 

inside the bacterial host cell in which the protein was expressed. In another aspect of 
the invention, an expressed protein may be used in a crude lysate form, which is 
produced by lysis of a host cell in which the protein was expressed. In yet another 
embodiment of the invention, an expressed protein may be partially isolated or 

20 partially purified according to any of the methods set forth or described herein. The 
skilled artisan will know when it is not desirable to isolate or purify a protein of the 
invention, and will be familiar with the techniques available for the use and 
preparation of such proteins. 

25 II. Methods of providing a protein to a customer 

The present invention features a method of providing a protein to a 
customer. In an embodiment of the invention, a nucleic acid encoding a protein is 
cloned into an expression vector. The encoded protein is expressed from the 
expression vector, and the resulting protein product is provided to a customer. 

30 In an embodiment of the invention, a protein is expressed from an 

expression vector in vitro. Techniques for in vitro protein expression are known in 
the art, and are exemplified by the methods of Melton and colleagues (Krieg et aL, 
1987, 
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Meth. Enzymol. 155, 397-415; Yisraeli et al., 1989, Meth. EnzymoL 180, 42-50). A 
protein produced using an in vitro expression method of the present invention is 
provided to a customer. 

In another embodiment of the invention, a protein is expressed from an 
5 expression vector in vivo. Numerous techniques for expression of a protein from an 
expression vector in vivo are described in detail elsewhere herein, and are also well- 
known in the art. Such techniques include, but are not limited to, expression of a 
protein from a vector in a bacterial host cell. A protein produced using an in vivo 
expression method of the present invention is provided to a customer. 

10 In one aspect of the invention, a protein is expressed from a pcWINl 

expression vector, as set forth in SEQ ID NO:l. In another aspect of the invention, a 
protein is expressed from a pcWIN2 expression vector, as set forth in SEQ ID NO:2. 
In yet another aspect of the invention, a protein is expressed from a pcWIN2/MBP 
expression vector, as set forth in SEQ ID NO:3. In still another aspect of the 

15 invention, a protein is expressed from a pCWin2-MBP-SBD 39 (pMS 3 9) expression 

vector, as set forth in SEQ ID NO: 10. In yet another aspect of the invention, a protein 
is expressed from a pCWin2-MBP-MCS-SBD 39 (pMXS 39 ) expression vector, as set 
forth in SEQ ID NOrll. As will be understood by one of skill in the art, a pcWIN 
vector of the present invention is useful in any of the expression methods set forth 

20 herein for the production of a target protein that may be provided to a customer. 

Methods of the present invention for in vivo expression of a protein in 
a bacterial cell comprise transformation of the bacterial cell with an expression vector 
comprising the protein of interest. Methods of transforming a bacterial cell with a 
vector are described in detail elsewhere herein, and would be understood by one of 

25 ordinary skill in the art. It will be appreciated that methods of bacterial cell 

transformation other than those explicitly disclosed herein are useful in methods of 
the present invention, and therefore, are within the scope of the present invention. 

Vectors featured in methods of the present invention are described in 
detail elsewhere herein. In an embodiment of the invention, a method of providing a 

30 protein to a customer comprises expressing a protein from an expression vector useful 
for production of a therapeutic protein. As described above, vectors of the invention 
useful in such methods are comprised of an antibiotic resistance marker such as 
kanamycin, tetracycline, chloramphenicol, and the like, as such antibiotics are 
particularly useful in connection with the expression of therapeutic proteins. A 
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therapeutic protein provided to a customer using this method is particularly useful to 
the customer due to the fact that kanamycin, tetracycline, chloramphenicol and like 
antibiotics are preferred by certain regulatory agencies for the production of 
therapeutic proteins. 

5 The present invention therefore features a method of providing a 

known therapeutic protein to a customer. Therapeutic proteins include, but are not 
limited to, human growth hormone, granulocyte colony stimulating factor, interferons 
alpha, -beta, and -gamma, Factor IX, follicle stimulating hormone, beta-glucosidase, 
interleukin-2, erythropoietin, alpha-gaiactosidase-A, and anti-TNF-alpha. It will be 

10 understood by the skilled artisan that any nucleic acid encoding a therapeutic protein, 
wherein the nucleic acid is capable of being cloned into and expressed from a nucleic 
acid vector of the invention, will be useful in the present invention. The ability to 
determine a therapeutic protein useful in the present invention is within the skill of the 
ordinary artisan and such a determination does not require undue experimentation. 

15 In another embodiment of the invention, a method of providing a 

protein to a customer comprises expressing a protein from an expression vector useful 
for production of a reagent protein. Vectors of the invention useful in such methods 
preferably have had the native ampicillin resistance gene disabled, altered, or deleted, 
such that the ampicillin resistance gene is no longer functional in the vector. Such 

20 vectors are comprised of any antibiotic resistance marker other than ampicillin known 
in the art to be useful in the expression of proteins. Antibiotic resistance markers 
useful in the invention include kanamycin, tetracycline, chloramphenicol, and like 
antibiotic resistance markers approved by certain regulatory agencies, as well as any 
antibiotic resistance marker not approved by certain regulatory agencies for use with 

25 therapeutic proteins. 

Therefore, the present invention also features methods of providing a 
protein to a customer, wherein the protein is a reagent protein, and therefore, need not 
be expressed from a vector containing antibiotic resistance marker accepted by a 
regulatory agency. However, a reagent protein may also be expressed from a vector 

30 containing an FDA-accepted antibiotic resistance marker. A protein produced by 
such a method of the invention may be useful for almost any purpose, including, but 
not limited to, an enzyme reagent, a food enzyme, a nutritional supplement, and a 
non-active additive. Examples of such proteins include, but are not limited to a 
glycosyltransferase and a sugar nucleotide-generating enzyme. 
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In an embodiment of the invention, a method is provided wherein a 
nucleic acid is cloned into a vector containing any antibiotic resistance marker useful 
in the expression of a protein, the protein is expressed therefrom, and the resulting 
protein product is provided to a customer. It will be understood that such a protein 
5 may be expressed in vivo or in vitro. 

Methods of the present invention also feature a vector comprising a 
highly-functional multiple cloning site. Such vectors are described in detail elsewhere 
herein. In an embodiment of the invention, a method of providing a protein to a 
customer comprises expressing a protein from an expression vector useful for 
10 production of a therapeutic protein. As described above, vectors of the invention 

useful in such methods are comprised of a highly-functional multiple cloning site, in 
addition to an antibiotic resistance marker such as kanamycin, tetracycline, 
chloramphenicol, and the like. In this embodiment, a therapeutic protein is expressed 
cloned into a vector comprising a highly-functional multiple cloning site in addition to 
15 an antibiotic resistance marker, expressed therefrom, and provided to a client. In one 
aspect of the invention, the multiple cloning site contains at least one of Ndel, 
BamHI, SacI, Hindm, Xbal, Xhol, EcoRI, Kpnl, and Sail restriction enzyme cleavage 
sites. 

Methods of the present invention also feature a vector comprising an 
20 affinity tag. Such vectors are described in detail elsewhere herein. In an embodiment 
of the invention, a method of providing a protein to a customer comprises expressing 
a protein from an expression vector useful for production of a therapeutic protein. As 
described above, vectors of the invention useful in such methods are comprised of an 
affinity tag, in addition to an antibiotic resistance marker such as kanamycin, 
25 tetracycline, chloramphenicol, and the like. In this embodiment, a therapeutic protein 
is expressed cloned into a vector comprising an affinity tag in addition to an antibiotic 
resistance marker, expressed therefrom, and provided to a client. In a preferred 
embodiment, the affinity tag is a maltose-binding protein. 

In other embodiments of the invention, a useful affinity tag may be, but 
30 is not limited to, a histidine tag, a Factor EX tag, a glutathione-S-transferase tag, 
starch-binding domain, a FLAG-tag, and the like. One of skill in the art will 
understand that any affinity tag capable of being used with a vector of the present 
invention will be useful in methods of the invention. Further, the skilled artisan will 
also appreciate that a single vector of the invention may comprise more than one 
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affinity tag, and that multiple affinity tags may be identical or may be heterogeneous 
in sequence. 

The present invention also features a method of providing a protein to 
a customer, wherein an expression vector used to express a protein has multiple 
5 characteristics as described elsewhere herein. For example, a method of the present 
invention for providing a protein to a customer comprises the cloning of a nucleic acid 
encoding the protein into an expression vector, wherein the expression vector 
comprises a kanamycin resistance marker and a highly-functional multiple cloning 
site. 

10 In an embodiment of the invention, a method of providing a protein to 

a customer comprises the cloning of a protein into the expression vector set forth in 
SEQ ID NO: 1 . The protein-SEQ ID NO: 1 construct is transformed into an E. coli 
cell, the protein is expressed therefrom, and the protein product is provided to the 
customer. In one aspect of the invention, the E. coli cell is a JM109 cell. In another 

15 aspect of the invention, the protein is a therapeutic protein. In yet another aspect of 
the protein, the protein is a reagent protein. 

In another embodiment of the invention, a method of providing a 
protein to a customer comprises the cloning of a protein into the expression vector set 
forth in SEQ ID NO:2. The protein-SEQ ID NO:2 construct is transformed into an E. 

20 coli cell, the protein is expressed therefrom, and the protein product is provided to the 
customer. In one aspect of the invention, the E. coli cell is a JM109 cell. In another 
aspect of the invention, the protein is a therapeutic protein. In yet another aspect of 
the protein, the protein is a reagent protein. 

In another embodiment of the invention, a method of providing a 

25 protein to a customer comprises the cloning of a protein into the expression vector set 
forth in SEQ ID NO:3. The protein-SEQ ID NO:3 construct is transformed into an E. 
coli cell, the protein is expressed therefrom, and the protein product is provided to the 
customer. In one aspect of the invention, the E. coli cell is a JM109 cell. In another 
aspect of the invention, the protein is a therapeutic protein. In yet another aspect of 

30 the protein, the protein is a reagent protein. 

In still another embodiment of the invention, a method of providing a 
protein to a customer comprises the cloning of a protein into the expression vector set 
forth in SEQ ID NO:10. The protein-SEQ ID NO:10 construct is transformed into an 
E. coli cell, the protein is expressed therefrom, and the protein product is provided to 
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the customer. In one aspect of the invention, the E. coli cell is a JM109 cell. In 
another aspect of the invention, the protein is a therapeutic protein. In yet another 
aspect of the protein, the protein is a reagent protein. 

In another embodiment of the invention, a method of providing a 
5 protein to a customer comprises the cloning of a protein into the expression vector set 
forth in SEQ ID NO: 1 1 . The protein-SEQ ID NO: 1 1 construct is transformed into an 
E. coli cell, the protein is expressed therefrom, and the protein product is provided to 
the customer. In one aspect of the invention, the E. coli cell is a JM109 cell. In 
another aspect of the invention, the protein is a therapeutic protein. In yet another 

10 aspect of the protein, the protein is a reagent protein. 

The present invention features a method of providing a protein to a 
customer, wherein a nucleic acid encoding a protein is cloned into an expression 
v vector of the invention by the party providing a vector to a recipient. That is, a 
nucleic acid encoding a protein is cloned into an expression vector of the invention 

15 before the vector is transferred to a recipient. In an embodiment of the invention, a 
method of providing a protein to a customer comprises providing a vector to a 
recipient, wherein the vector contains a nucleic acid encoding a protein. The recipient 
of the vector expresses the protein, and the protein is then provided to a customer. In 
one aspect of the invention, a recipient is a protein production facility. 

20 In an embodiment of the invention, a method of providing a protein to 

a customer comprises providing a vector to a recipient, wherein the vector does not 
contain a nucleic acid encoding a protein. The recipient of the vector clones a nucleic 
acid encoding a protein into the vector and expresses the protein, and the protein is 
then provided to a customer. In one aspect of the invention, a recipient is a protein 

25 production facility. In another aspect of the invention, the nucleic acid cloned into a 
vector is provided by the party providing the vector to the recipient. In yet another 
aspect of the invention, the nucleic acid cloned into a vector is provided by the 
recipient. 

The invention also features a method of providing a protein to a 
30 customer, wherein the method comprises providing a vector to a recipient, wherein 
the vector comprises a nucleic acid encoding a protein, for the purpose of expression 
of the protein encoded by the vector. In one embodiment of the invention, the 
recipient is a protein production facility. In one aspect, the protein production facility 
is in-house. By way of non-limiting examples, such recipients include an in-house 
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protein production facility and an in-house laboratory. In another aspect of the 
invention, the protein production facility is offsite. By way of non-limiting examples, 
such recipients include an offsite protein production facility, an offsite laboratory, an 
offsite biotechnology company and an offsite pharmaceutical company. 
5 A protein produced using a method of the present invention may be 

provided to a customer by the recipient of the vector, wherein the recipient is 
responsible for expressing the protein from the vector provided by another party. 
Alternatively, a protein produced using a method of the present invention may be 
provided to a customer by the original party that provided the vector to the recipient, 

10 wherein the recipient expresses the protein and provides the resulting protein product 
to the original party so that the original party may provide the protein to a customer. 

A protein produced using a method of the present invention may be 
provided to a customer in the form of a purified protein, a partially purified protein, 
an isolated protein, a partially isolated protein, a bacterial cell lysate, cell paste or 

15 purified inclusion bodies. It will be understood that a protein produced using a 

method of the present invention may be provided to a customer in any form known in 
the art to be useful for the storage, transfer, or processing of a recombinant protein. 

The present invention also features a method of providing a protein to 
a customer, wherein at least one glycosyl moiety is added to the protein before 

20 providing the protein to the customer. A glycosyl moiety may be added to a protein 
using any method known in the art. Additionally, a glycosyl moiety may be added to 
a protein of the invention using any one of the methods or reagents taught by DeFrees 
et al. in PCT Application WO 03/031464, which is incorporated herein by reference 
in its entirety. The skilled artisan will understand, based on the disclosure herein, that 

25 any of the methods known in the art or set forth herein are useful for glycosylating a 
protein of the present invention prior to providing the protein to a customer. 

Thus, in an embodiment of the present invention, a method of 
providing a protein to a customer comprises providing a vector comprising a nucleic 
acid encoding a protein to a recipient, wherein the recipient expresses the protein, and 

30 further wherein the protein is modified with at least one glycosyl moiety before the 
protein is provided to a customer. In one aspect of the invention, at least one glycosyl 
moiety is added to the protein by the recipient before providing the protein to a 
customer. In another aspect of the invention, at least one glycosyl moiety is added to 
the protein by the recipient before providing the protein to the original supplier of the 
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vector, wherein the original supplier of the vector subsequently provides the protein to 
a customer. In yet another aspect of the invention, the recipient provides the 
expressed protein to the original supplier of the vector, wherein the original supplier 
of the vector adds at least one glycosyl moiety to the protein before providing the 
5 protein to a customer. 

In another embodiment of the invention, a method of providing a 
protein to a customer comprises providing a vector to a recipient, wherein the 
recipient clones a nucleic acid encoding a protein into the vector, expresses the 
protein, and further wherein the protein is modified with at least one glycosyl moiety 

10 before the protein is provided to a customer. In one aspect of the invention, at least 
one glycosyl moiety is added to the protein by the recipient before providing the 
protein to a customer. In another aspect of the invention, at least one glycosyl moiety 
is added to the protein by the recipient before providing the protein to the original 
supplier of the vector, wherein the original supplier of the vector to the recipient 

15 provides the protein to a customer. In yet another aspect of the invention, the 

recipient provides the expressed protein to the original supplier of the vector, wherein 
the original supplier of the vector adds at least one glycosyl moiety to the protein 
before providing the protein to a customer. 

20 EXPERIMENTAL EXAMPLES 

The invention is now described with reference to the following 
examples. These examples are provided for the purpose of illustration only and the 
invention should in no way be construed as being limited to these examples but rather 
should be construed to encompass any and all variations which become evident as a 

25 result of the teaching provided herein. 

Example 1: Modification of pCWori+ Amp r expression vector by disrupting the 
Amp r gene and adding the kanamvcin resistance gene 

30 The pCWori+ Amp r vector contains an ampicillin resistance marker, as 

well as the genes encoding N. Meningitidis CMP-NAN synthetase (CNS) and 
Campylobacter Jejuni oc2,3 Sialyl Transferase (CstI), referred to as Cst-04 (Cst-04 
was provided by Warren Wakarchuck, National Research Council, Canada). This 
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example describes the complete process by which the Cst-04 (pCWori+ amp r -CNS- 
Cstl) plasmid was interrupted at the Pvul and Seal sites of ampicillin gene by the 
insertion of the kanamycin resistance gene. 

A kanamycin resistance gene was isolated from pGEX-Kt-ext Kan r 
5 using PCR to generate cDNA with modified restriction sites at 5' (Pvul- 

ATTCCAATTCGATCGGGGGGGGGGGGAAA) (SEQ ID NO:4) and 3' (Scal- 
ATTCCAAGTACjTACTTTAGAAAAACTCATCG) (SEQ ID NO:5) ends. The PCR 
product was then subcloned into a Cst04 (pCWori+ amp r -CNS-CstI) vector in TGI 
cells. A colony positive for the recombinant vector (Cst-04Kan5) was identified, and 

10 ; the Cst-04Kan5 plasmid was isolated, then transformed into JM109 cells. 

A PCR reaction was conducted containing 1 ng pGEXKan r template, 1 
jxg (1 ill) kan-Scal/Pvul primer, 77 jllI H 2 0, 8 \xl dNTP mixture, 10 |*1 10X buffer, and 
1 \xl Vent polymerase. The reaction parameters included a 5 minute cycle at 95 °C, 
followed by the addition of 1 |xl of Vent polymerase and thirty cycles of the following 

15 temperature pattern: 94 °C for one minute, 55 °C for one minute, 72 °C for one 
minute. 

The PCR product and the Cst-04 vector were subjected to a restriction 
digest. The PCR product digest included 16 jllI PCR rxn, 2 |xl 10X buffer, 0.5 jd 
Pvul, 0.5 \xl Seal, 1 Ml H 2 0. The pGEX-Ktext Kan r vector digest included 1 \xl 

20 pGEX-Ktext Kan r vector, 2 julI 10X buffer, 0.5 fxl Pvul, 0.5 pi Seal, 1 jlxI H 2 0. Both 
digests were incubated at 37 °C for 3 hours. Both the digested PCR fragment and the 
digested vector DNA were purified from 0.8% TAE agarose gels (Figure 1 A). 

The PCR product was then ligated into Cst-04 vector. The ligation 
reaction contained 7 fil gel-purified Kan r gene (cut with Sacl/Xbal), 1 fil gel-purified 

25 Cst-04 vector (cut with BamHE/EcoRI), 1 /xl 10X Ligation Buffer 1 \i\ T4 DNA 
ligase, and was incubated on ice overnight. The ligated PCR product was then 
transformed into the TGI competent cells. The transformation reaction conditions 
included 500 \i\ (thawed on ice) TGI Competent cells and Syd pGEX-KT-exT-kan r - 
CNS-Cstl ligation rxn. The cell/DNA mixture was incubated on ice for 30 minutes, 

30 and the cells were heat shocked at 42°C for 45 seconds and then incubated on ice 

again for 2 minutes. 500 jLtl LB broth was added and the mixture was shaken at 37°C 
for 1 hour. The transformation reactions were then plated on LB Kan r plates and 
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incubated 37°C overnight. The results of the transformation reactions are shown in 
Figure IB. 

Positive clones were screened for using the following method. Two milliliters 
of 9x LB/Kanamycin (10|xg/ml) culture was incubated with individual transformants 
5 at 37°C o/n using 250 RPM shaking. 1.5 milliliters of the overnight culture was 

transferred to an eppendorf tube to isolate plasmid DNA using Wizard Mini-Prep Kit 
(Qiagen, Valencia, CA). An insert-containing colony (Cst-04-Kan5, Figure 1C) was 
expanded in 100 ml of LB culture in order to isolate more plasmid DNA. 

The Cst-04-Kan5 plasmid-containing colony was screened for 

10 kanamycin and ampicillin resistance. Cst-04-Kan5 colony was streaked on both 
AFLB Kan r and Amp r plates, which were incubated overnight at 37 °C. Figure ID 
shows that, in colony Cst-04Kan5, the kanamycin resistance gene is active and the 
Ampicillin resistance gene is inactive. 

The Cst-04Kan5 plasmid was transfomed into E. coli JM109 cells 

15 using 20 \xl JM109 competent cells in 0.34 jul 1.42 M beta mercaptoethanol, incubated 
on ice for 10 minutes, then adding IjliI (100 ng) Cst-04-Kan5 plasmid. The cell/DNA 
mixture was incubated ice for 30 minutes, then heat shocked at 42 °C for 45 seconds. 
The reaction was then incubated on ice for 2 minutes, at which time 80 jutl SOC or LB 
was added. The reaction mixture was shaken at 37°C for 1 hour, then plated on LB 

20 Kan r plates. Identification and confirmation of the Cst-04-Kan5 plasmid DNA was 
carried out with a restriction enzyme digestion of plasmid DNA isolated from positive 
transformants, using Ndel, Sail, PstI restriction enzymes. The restriction fragment 
sizes were -7.3 kb for one cut, such as Ndel or Sail. Three bands (~1.7kb, ~2.2kb, 
and ~3.2kb) were observed when Cst-04-Kan5 DNA was cut with PstI (See Figure 

25 1C, lane 4). 

Starter cultures of Cst-04Kan5 plasmid-containing cells were produced 
and used to inoculate 100ml cultures for the generation of cell lysates. Centrifuged 
cell pellets resulting from the large-scale cultures were resuspended in 5ml H2O prior 
to lysis in a French press. The resultant lysate was centrifuged at 4°C, 18,000 RPM for 
30 20 minutes. The clarified lysate was subsequently used for activity analysis and the 
remainder of the lysate was stored at -20°C. 

The activity of the cell lysates was determined under the assay 
conditions illustrated in Table 1. 
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Table 1. 

Final Stock 
Reagent Concentration Concentration Amounts 



CTP 


ImM 


100 mM 


1 uj 


NAN 


ImM 


200 mM 


0.5 Ml 


LacPsy 


0.5 mM 


2.5 mM 


20 Ml 


MgC12 


50 mM 


1M 


5 Ml 


Tris pH 8 


lOOmM 


1M 


10 Ml 


Lysate 


15 % 


Crude 


15 Ml 


dH20 






63 Ml 


Total 






100 Ml 



reaction 
volume 

Table 1 lists the reagents, and concentrations and volumes thereof, used in the lysate 
5 activity assays. 

The lysate assay reagents were mixed and incubated at 37°C. Time points were taken 
at 0 minutes and 1 hour. A negative control (pGEX-Kt-exT-kan r vector without insert) 
was also included. All time points were analyzed using thin layer chromatography 
10 (CHCl3:CH 3 OH:H 2 O:NH4OH:60:40:5: 1 respectively). The plates were air dried, 
dipped in anisaldehyde and heated on a hot plate until the spots developed. Cst- 
04Kan5 lysate was also assayed for activity using lacto-N-neotetraose as substrate. 
The lacto-N-neotetraose substrate activity is illustrated in Figure IE. Activity of 
lysates from Cst-04Kan5 plasmid-containing cells was 8500 units/liter. 

15 

Example 2: Modification of the polvlinker of pCWori+ Kan r expression vector. 

pCWori+ Kan r (Cst-04Kan5) contains the genes encoding N. 
meningitidis CMP-NAN synthetase (CNS) and Campylobacter jejuni a2,3 Sialyl 
20 Transferase (CstI) at the multiple cloning site. This vector appears to give high levels 
of expression of recombinant proteins but is hard to use due to the limited number of 
restriction sites in the multiple cloning site (MCS). Therefore, the multiple cloning 
site was modified and expanded as described herein. 

The multiple cloning site starting at Ndel restriction site and extending 
25 to the start of the inactive Amp r gene from Cst04Kan5 was generated using PGR to 
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generate cDNA with modified multiple cloning sites at 5' (Pcwmcs (Ndel) - 
ATCGATCGACATATGGGATCCGAGCTCAAGCTTTCTAGACTCGAGGAATT 
CGGTACCGTCGACATCGATGATAAGCTGTCAAA) (SEQ ID NO:6) and 3' 
(Scal-ATTCCAAGTAGTACTACTCTTCCTTTTTCAA) (SEQ ID NO:7) ends of 
5 pCWINl construct. The PCR primers set for pre pCWIN2 construct are 5' (Bgl II- 
CAATTATATAGATCTATCGATGCTTAGGAGGT) (SEQ ID NO:8) and 3* (Cstl- 
Xba-TTGCCTTATTCTAGATCATTAGTGGTGATGGTGGTG) (SEQ ID NO:9). 
The PCR products were then subcloned into Cst04kan5 (pCWori+ kan r -CNS-CstI) 
vector, transformed into TGI cells, and screened for the correct construct. 

10 Two PCR reactions were conducted, using 10 ng Cst04Kan5 cDNA as 

a template. The first reaction contained 1 \ig (1 |il) Pcwmcs/Scal-pcw primer, 78 |nl 
H 2 0, 8 (xl dNTP mixture, 10 jlxI 10X buffer, and 1 jutl Vent polymerase. The second 
reaction contained 1 \ig (1 pi) 5'pcBglH/Cstl-Xba primer, 78 julI H 2 0, 8 jlxI dNTP 
mixture, 10 |Lil 10X buffer, and 1 Jul Vent polymerase. The PCR reaction parameters 

15 included a 5 minute cycle at 95 °C, followed by the addition of 1 jutl of Vent 

polymerase and thirty cycles of the following temperature pattern: 94 °C for one 
minute, 55 °C for one minute, 72 °C for one minute. 

The PCR products were subjected to a restriction digest. The first PCR 
reaction product ("pCWINl" insert) digest included 16 ^1 PCR rxn, 2 \i\ 10X buffer, 

20 0.5 \i\ Ndel, 0.5 |il Seal, 1 |Lil H 2 0, and the second PCR reaction product ("pre 

pCWIN2" insert) digest included 16 ^ PCR rxn, 2 jxl 10X buffer, 0.5 \il BglH, 0.5 jil 
EcoRI, 1 \il H 2 0. A pCWori Kan r Cst04Kan5 vector was prepared for insertion of the 
first PCR product by incubation of 2 jixl (1 \ig) Cst04Kan5 vector, 2 (xl 10X buffer, 0.5 
|Ltl Ndel, 0.5 |Lil Seal, 1 |Lil H 2 0. A pCWori Kan r Cst04Kan5 vector was similarly 

25 prepared for insertion of the second PCR product by incubation of 2 jjJ (1 jag) 
Cst04Kan5 vector, 2 |ml 10X buffer, 0.5 jlxI BamHI, 0.5 \xl EcoRI, 1 |Lil H 2 0. 

The digested PCR fragments and digested vectors were purified from 
0.8% TAE agarose gels (Figure 2A). The PCR products were then subcloned into Cst- 
04kan5 vectors by ligation (Table 2) and electroporated into the TGI competent cells. 

30 Electroporation reactions included 30 \xl thawed (on ice) TGl/DH5a 

electrocompetent cells and 3 jlxI ligation reaction mixture. The DNA/cell 
electroporation mixture was transferred to a chilled cuvette, and the cells were 
subjected to electroporation using pulses of 2.5 KV, R5 resistance, and 129 ohms. 0.9 
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ml SOC media was then added to the reaction mixture, and the entire culture was 
incubated at 37 °C for one hour, at which time the electoporation product was 
incubated overnight after plating on LB agar plates containing 50 ng/ml Kan r . 

5 Table 2 



1. pCWINl 

Gel-purified pCWINl insert (digested A) 7 fil 

Gel-purified Cst-04kan5 vector (digested B) 1 fil 

10X Ligation Buffer 1 fil 

10 T4 DNA Ligase 1 /xl 

2. pre pCWIN2 

Gel-purified pre pCWIN2 insert (digested C) 7 pi 

Gel-purified Cst-04kan5 vector (digested D) 1 fil 

15 10X Ligation Buffer 1 fil 

T4 DNA Ligase 1 fil 

3. pCWIN2 

Gel-purified pCWINl#5 insert (digested E) 7 fil 
20 Gel-purified pre pCWIN2#l 1 vector (digested F) 1 fil 

10X Ligation Buffer 1 fil 

T4 DNA Ligase 1 fil 



Table 2 illustrates the ligation reaction conditions for pCWINl, pre-pCWIN2, and 
25 pCWIN2 PCR reaction products. Both pCWINl and pre pCWIN2 ligations were 
incubated at 4°C overnight. 

Screening of transformants for positive clones was then conducted. 
Five colonies pCWINl -containing TGI cells were selected, as were 9 colonies of pre- 

30 pCWin2 in DH5oc, and 18 colonies of pCWin2 in TGI. Each was placed into 2 ml 
TB/kanamycin (50|Ag/ml) and incubated at 37°C for 5 hours with shaking at 250 
RPM. 1.5 milliliters of each culture was transferred to an eppendorf tube to isolate 
plasmid DNA using Wizard Plus Mini-Prep Kit (Qiagen, Valencia, CA). Each 
plasmid DNA preparation was subjected to restriction digestion with the appropriate 

35 restriction endonucleases as described above. The digestion reactions were then 
analyzed on agarose/TAE gels (Figures 2B - 2D). 

One colony of pCWinl (colony #5) and one colony of pre pCWin2 
(colony #11) contained the appropriate sized inserts. Specifically, the expected size 
for the pCWinl insert was 5 kb and the expected size for the pre-pCWin2 insert was 7 

40 kb (Figure 2B). pCWinl and pre pCWin2 plasmid DNA were digested with Ndel and 
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Seal. Ndel and Seal digestion o£ pCWinl plasmid generated bands of 750 bp and 
4.2kb, and the same digestion of pre-pCWin2 plasmid generated bands of 2.8 kb and 
4.2 kb (Figure 2C). Figure 2D illustrates the result of ligating the 750 bp of pCWinl 
with the 4.2 kb fragment of pre-pCWin2 to generate the pCWin2 expression vector. 
5 The difference between pCWinl and pCWin2 expression is that pCWinl has two 
BamHl sites, one in being in the tac promoter and the other being in the multiple 
cloning site (down stream of Ndel). pCWin2 has only one BamHl site, in the 
multiple cloning site, and the BamHl that resided in the tac promoter was destroyed. 

10 Example 3: Addition of Maltose Binding Protein to pCWin2 Kan r Expression Vector 

The E.coli malE gene, encoding a maltose binding protein, was 
subcloned into the pCWin2 kan r bacterial expression vector. The malE gene was PCR 
amplified from pMal-c2X, ligated into the multiple cloning site of pCWin2 kan r , and 
15 subsequently transformed into electrocompetent DH5a E. colu The final product, a 
pCWin2MBP kan r bacterial fusion tag expression vector, was created as described 
below. 

Restriction endonuclease digestion of pCWin2 kan r and pMAL-c2X 
amp r was conducted to prepare the malE maltose binding protein cDNA and the 

20 pCWin2 vector cDNA for insertion of the malE cDNA into the pCWin2 vector. 

Digestion of the malE cDNA was conducted using 2 ill of pMAL -C2X vector DNA 
(1 jtig//d), 2 fil 10X BamHl NEbuffer, 2 fil 10X purified BSA, 1 /il Ndel, 1 fil BamHl, 
and 12 fil dH 2 0. Digestion of the vector was conducted using 2 ill pCWin2 vector 
DNA 0.8/xg/^l, 2 jullOX BamHl NEbuffer, 2 fil 10X purified BSA, 1 fil Ndel, 1 /xl 

25 BamHl, and 12 fil dH 2 0. 

The restriction enzyme digestions were incubated at 37°C for two 
hours. The reactions were stopped by adding 3 fil Blue/Orange 6x Agarose Loading 
Dye. The digestions were then loaded onto separate 0.7% agarose/TAE gels, and 
electrophoresed at 135 volts until the dyes migrated to the lower third of the gel. An 

30 image of the pCWin2 vector digestion agarose gel was captured using a digital 
camera (Figure 3B). An image of the polyacrylamide gel containing the purified 
product from the digestion of malE is shown in Figure 3A. 

The linearized pCWin2 kan r and malE fragments were gel purified. 
Using the UV box to illuminate the DNA, the bands of DNA were excised using a 
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sterile scalpel. The pCWin2 kan r DNA was approximately 5 kb, and the malE gene 
was approximately 1.2 kb. The excised agarose wedges were placed into Ultrafree DA 
agarose extraction spinfilters (Millipore, Bellerica, MA), and microcentrifuged at 
10,000 xg for 5 minutes. The filtrates were transferred to YM-100 spinfilters 
5 (Millipore, Bellirica, MA), and the DNA was washed by adding 300 fil dH 2 0. The 
spinfilters were centrifuged at 500 xg for 15 minutes, and the wash step was repeated 
two additional times. The last wash step concentrated the DNA to an approximate 
volume of 25 jil, at which time the column was inverted into another autoclaved 1.7 
mL eppendorf tube. The DNA retentate was collected by microcentrifuging the 

10 eppendorf at lOOOxg for one minute. 

The ligation of gel purified malE and linearized pCWin2 kan r was 
performed in an autoclaved 0.5 mL eppendorf microcentrifuge tube. The ligation 
reaction included 7 /xl of purified malE DNA that was digested with Ndel and BamHI, 
1 /xl of linearized pCWin2 kan r that was digested with Ndel and BamHI, 1 fi\ 10X 

15 ligase buffer, 1 fil T4 DNA ligase. The ligation reaction was incubated at room 
temperature for three hours. In the vector control ligation reaction, dH^O was 
substituted for the malE DNA. 

The ligation reactions were transformed into electrocompetent.DHSa 
E.colL After a three hour ligation incubation, one aliquot of electrocompetent DH5a 

20 E.coli was removed from a -81°C freezer, and placed on ice to thaw. 20 fil of the 
cells was aliqouted into chilled, autoclaved 1.7 mL microcentrifuge tubes, and then 
one microliter from each ligation reaction was added to the cells. Immediately, the 
reactions were transferred to chilled electroporation cuvettes. The cells were 
electroporated with a 2.5kV 6 msec pulse as described in the manufacturer's 

25 instructions. Then one milliliter of AFLB SOC media was added to the 

transformation reactions, and the entire volume was transferred to an autoclaved 1.7 
mL microcentrifuge tube. The transformation reactions were incubated at 37°C for 
one hour with shaking at 250 rpm. After incubating the cells for an hour, 100 jd 
from each transformation reaction was plated by spreading onto LB agar kan r plates. 

30 The plates were incubated at 37°C overnight. 

Results from the ligation and transformation reactions are as follows. 
The pCWin2 MBP vector plating resulted in thirteen colonies, and out of ten colonies 
selected, nine were positive for the recombinant vector. The pCWin2 vector control 
plating did not contain any colonies. Colonies from the pCWin2 MBP vector LB agar 
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plates were selected and used to inoculate 2 ml AFLB kan r . The starter cultures were 
grown overnight at 37°C, with shaking at 250 rpm. Plasmid DNA was isolated from 
the transformants and screened for the correct malE insert by a double digestion with 
Ndel and BamHI restriction endonucleases. 
5 Restriction digestion of miniprep DNA was conducted in a reaction 

mixture containing 12 fil Miniprep DNA, 1 /xl Ndel endonuclease, 1 fil BamHI 
endonuclease, 1.5 fil lOXBamHI NEBuffer, and 1 pi 10X purified BSA. The 
digestion reactions were mixed and incubated at 37°C for one hour. After incubation, 
3 /xl of 6x Agarose Gel Loading Dye was added to each restriction digestion. The 

10 restriction digestions were then loaded into the wells of a 0.7% agarose/TAE gel. The 
samples were electrophoresed at 135 volts until the dye migrated to the lower third of 
the gel. The gel was then removed from the gel box, and the image captured with a 
digital camera (Figure 3C). 

, Large scale purified pCWin2MBP vector DNA was isolated from 

15 transformant #1 using the HiSpeed Plasmid Maxi Kit (Qiagen, Valencia, CA). A 2 
raL AFLB kan r starter culture was inoculated with 10 ftl pCWin2MBP DH5a E.coli 
overnight culture. This starter culture was grown overnight in a 37°C incubator, with 
shaking at 250 rpm. The overnight starter culture was used to inoculate two 125 mL 
AFLB kan r cultures, and these larger scale preps were grown overnight at 37°C with 

20 shaking at 250 rpm. DNA from the large scale preparation was used for sequencing of 
the malE insert subcloned between the Ndel and BamHI restriction sites (MWG 
Biotech, High Point, NC). The sequence of the vector is set forth in SEQ ID NO:3. 

Example 4: Preparation and Characterization of pCWin2-MBP-SBDWpMSW) Vector 
25 The pMS 39 Kan R expression vector was created from the pCWIN2- 

MBP- SBD-ST3 Gal HI (GalBl,3(4)GlcNAc a2,3-Sialyltransferase) A73 construct, 
removing the the ST3 Gal m A73 gene and replacing it with the Multiple Cloning site 
(MCS) from the pcWIN2 vector. Selection was of final construct was determined by 
restriction enzyme analysis with Nde I and Xba I (there is no Xba I site the pCWIN2- 
30 MBP-SBD ST3 Gal IH construct) digestion and sequence confirmation. The final 
construct was designated the pCWin2-MBP-SBD (pMS 39 ) Kan r expression vector. 
The several steps of the preparation of this vector are detailed in Figures 7A-7G. 
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Example 5: Preparation and Characterization of the pCWin2-MBP-MCS-SBD 
(pMXS^) Vector 

The pCWin2-MBP-MCS-SBD (pMXS 3 9> expression vector was constructed 
according to the following method. The Starch Binding Domain (SBD) insert was 
5 isolated by PCR using the 5' primer ( XhoI-SBD-39-5' 

TGTATCCTCGAGATTGTGGCGACCGGCGGCACCAC) (SEQ ID NO: 12) and the 
3' primer (3' Sall-AAGCTTGTCGACTCATTAGCGCCAGGTATCGGTCACGG) 
(SEQ ID NO: 13). The PCR products were gel purified and ligated into PCR-Blunt 
vector. The correct SBD insert (in the PCR-Blunt vector) was digested with Xhol and 
10 Sail, subcloned into Xhol-Sall digested pCWin2-MBP kan r vector, transformed into 
TBI cells and screened for the correct construct. The several steps of the preparation 
of this vector are detailed in Figures 8A-8E. 



The disclosures of each and every patent, patent application, and publication 
15 cited herein are hereby incorporated herein by reference in their entirety. While the 
invention has been disclosed with reference to specific embodiments, it is apparent 
that other embodiments and variations of this invention may be devised by others 
skilled in the art without departing from the true spirit and scope of the invention. 
The appended claims are intended to be construed to include all such embodiments 
20 and equivalent variations. 
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